Modification of CHF and BIC coefficients for Evaluation of Clustering with Mixed Type Variables
نویسندگان
چکیده
Cluster analysis is a multivariate statistical method, which is used to classify objects. It is used in many areas, such as the classification of customers or respondents in various marketing surveys. Individual objects are characterized by different variables. Variables can be quantitative and qualitative. Depending on the type of variables it is necessary to select the appropriate method of measuring distances of objects and clusters. There are many ways how to measure these distances and it is not clearly defined how to choose specific measure in different conditions. Depending on the extent of distances and the method chosen may arise different clusters, and thus different results. For this reason, it is necessary to evaluate the clustering result. The evaluation should analyze the numbers of clusters and different clustering methods. There are many coefficients for evaluate results of clustering. In the current literature are defined in particular coefficients, which are used for the quantitative variables. For variables of mixed types (a combination of qualitative and quantitative) are coefficients described only in a very limited extent. The aim of this paper is to analyze the modified coefficients CHF and BIC on real data sets in case of mixed types variables.
منابع مشابه
Mixture-model cluster analysis using information theoretical criteria
The estimation of mixture models has been proposed for quite some time as an approach for cluster analysis. Several variants of the Expectation-Maximization algorithm are currently available for this purpose. Estimation of mixture models simultaneously allows the determination of the number of clusters and yields distributional parameters for clustering base variables. There are several informa...
متن کاملThe validity and reliability of the Brief Fear of Negative Evaluation Scale in women with tension-type headaches
Aim and Background: Examining fear of negative evaluation as one of psychological causes in headaches is important. The aim of the present study was to investigate validation and psychometric properties of the Brief Fear of Negative Evaluation Scale (BFNES, Leary, 1983) in a group of women with tension-type headaches. Methods and Materials: A total of 110 women with tension-type headaches in ...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملMean Activity Coefficients Measurements and Thermodynamic Modeling of the Ternary Mixed Electrolyte KCl + Lactose + Water System at T = 298.15 K
In this work, the mean activity coefficients of KCl in the KCl+lactose +water system were determined using the potentiometric method. The electromotive force measurements were carried out on the galvanic cell without liquid junction of the type: Ag|AgCl|KCl (m), lactose (wt.%), H2O (1−wt.) %|K-ISE, in various mixed solvent systems containing 0, 5,7.5, 10 and 12.5 % mass fractions of lactose. Th...
متن کاملCost Effective Heat Exchanger Network Design with Mixed Materials of Construction
This paper presents a simple methodology for cost estimation of a near optimal heat exchanger network, which comprises mixed materials of construction. Intraditional pinch technology and mathematical programming it is usually assumed that all heat exchangers in a network obey a single cost model. This implies that all heat exchangers in a network are of the same type and use the same mate...
متن کامل